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Abstract 

We develop new techniques for rounding packing integer programs using iterative random¬ 
ized rounding. It is based on a novel application of multidimensional Brownian motion in K". 
Let X G [0,1]" be a fractional feasible solution of a packing constraint Ax <1, A G {0, 
that maximizes a linear objective function. The independent randomized rounding method of 
Raghavan-Thompson rounds each variable Xi to 1 with probability Xi and 0 otherwise. The 
expected value of the rounded objective function matches the fractional optimum and no con¬ 
straint is violated by more than D( dgioAt )' contrast, our algorithm iteratively transforms 

X to X G {0,1}" using a random walk, such that the expected values of ijs are consistent with 
the Raghavan-Thompson rounding. In addition, it gives us intermediate values x' which can 
then be used to bias the rounding towards a superior solution. 

The reduced dependencies between the constraints of the sparser system can be exploited 
using Lovasz Local Lemma. Using the Moser-Tardos’ constructive version, x' converges to x in 
polynomial time to a distribution over the unit hypercube T-Ln = {0,1}" such that the expected 
value of any linear objective function over T-Ln equals the value at x. 

For m randomly chosen packing constraints in n variables, with k variables in each inequality, 
the constraints are satisfied within 0{ high probability where p is the 

ratio between the maximum and minimum coefficients of the linear objective function. For 
example, when m,k = y/n and p = polylog{n), this yields 0(loglogn/logloglogn) error for 
polylogarithmic weighted objective functions that significantly improves the error 

incurred by the classical randomized rounding method of Raghavan and Thompson |RT87| . 

Further, we explore trade-offs between approximation factors and error, and present appli¬ 
cations to well-known problems like circuit-switching, maximum independent set of rectangles 
and hypergraph ^-matching. Our methods apply to the weighted instances of the problems and 
are likely to lead to better insights for even dependent rounding. 
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1 Introduction 


Many combinatorial optimization problems can be modeled using a weighted packing integer pro¬ 
gram. Maximize X]i=isubject to Yliss- ^ 1 < J < Xj € {0,1}. 

Wlog, we can assume that maxj Cj = 1. Although the above formulation appears somewhat restric¬ 
tive having a fixed right hand side of each inequality, the methods that we develop extend to more 
general parameters. Often the constraints are expressed as Cj : Vj ■ x < 1 where Vj is a 0-1 incidence 
vector corresponding to set Sj C {1,2... n}. We will also use Xi to denote the z-th coordinate of 
a vector x. Since this version is also NP-hard as it captures many intractable independent set 
problems, a common strategy is to solve the Linear Program (LP) corresponding to the relaxation 
0 < Xi < 1. Then use the LP optimum OPT to obtain a good approximation of the optimal integral 
solution. Suppose the optimum is achieved by the vector x' = \x'^,x '2 ■ ■ . x^] In the conventional 
randomized rounding |ET87| . we round each Xi independently in the following manner. 

Xj = {1 with probability x[ and 0 otherwise}. 

Since E[xj] = x^, it follows that E[^^CjXi] = using Chernoff bounds, we can show 

that for all j, w.h.p. < ( 10 ^^ ) • 

The primary motivation for improved randomized rounding is improved approximation of in¬ 
teger linear packing problems. Historically, the original randomized rounding technique was pro¬ 
posed for obtaining better approximation of multicommodity flows |RT87| . Srinivasan |Sri95[ [Sri99) 
presents an extensive survey of many sophisticated variations of randomized rounding techniques 
and applications to approximation algorithms. It was established much later |CGKT07] that the 
Raghavan-Thompson bound cannot be improved as a consequence of the following result. 

Lemma 1.1 I [CQKTn7) I There exists a constant d > 0 sueh that the integrality gap of the multi- 
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commodity flow relaxation problem is 11(1/ dfor any eongestion d, 1 < d < {6 log n)/ log log n 
where n is the number of vertices in the graph and the integrality gap for d is with respect to con¬ 
gestion 1. 

Therefore no rounding algorithm can simultaneously achieve error d = o(logn/loglogn), and 
an objective function value of VL[0PT) for all polynomial size input (m < Note that this 

does not preclude improvement of the multicommodity flow problem approximation by alternate 
formulation - for this, there are alternate hardness bounds given by [CGRTOT] . 

Subsequent to the work of Raghavan-Thompson, Srinivasan |Sri951 [Sri96] Baveja and Srinivasan 
[BSDO] . Kolliopoulous and Stein |KS98| obtained results that focussed on circumventing the above 
bottleneck by making use of some special properties of the constraint matrices like column-restricted 
matrices. In particular, the focus shifted to designing approximation algorithms that are feasible 
(unlike the basic independent rounding of RT) by bounding dependencies between inequalities and 
using Lovasz Local Lemma (LLL) or even more sophisticated corelation inequalites like FKG [Sri96) . 
Leighton, Rao and Srinivasan |LR,S98| made very clever use of LLL to achieve nearly optimal results 
in graphs with short flow paths. The notion of short flow paths was developed further in the context 
of the unsplittable multicommodity flow problem with an explicit objective function in many papers 
including |KR96[ IBSOOl IKS061ICCGK07] . Intuitively, short flow paths reduce dependencies between 
conflicting routing paths and makes the algorithms more efficient in terms of assigning paths without 
exceeding the maximum allowed congestion (for edge-disjoint paths it is 1). 

In this paper, we attempt to provide a uniform framework behind many prior uses of indepen¬ 
dent rounding by recasting the process as a Brownian walk in high dimension and analyzing its 
convergence. For readers familiar with this technique, it may appear to be an overkill since the 
final outcome of Brownian motion and independent rounding are identical (we shall formalize this 
in the next section). However, we will demonstrate that this framework offers a cleaner exposition 
and simpler explanations of many of the previous clever, yet ad-hoc techniques. We have presented 
a detailed characterization of the time-dependent behavior of the random variables associated with 
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each inequality executing Brownian motion which was not studied before to the best of our knowl¬ 
edge. It could potentially offer new tools for analysis of more complicated rounding methods where 
the random variables may not be independent. 

We demonstrate two distinct applications of our new framework by addressing the rounding 
problem for the average case of an input family that we will define more precisely. 

(i) A simple approximation algorithm for weighted objective function. 

(ii) An improvement of the Raghavan-Thompson rounding error for a restricted class of weighted 
objective function. 

As opposed to the one-shot rounding of |RT87| , also referred as independent rounding, our 
rounding is iterative and based on successive refinements of the LP solution that has a lower variance 
per step, to yield a sparser set of constraints. This reduces dependencies between inequalities and 
makes the use of techniques like Lovasz Local Lemma (LLL) more effective. 


1.1 Main results and some applications 

Theorem 1.1 Given an m x n 0-1 matrix A, such that A - x' <h for x\ € [0,1], and an objective 
function CiXi, in randomized polynomial time x' can be transformed into y' which are [0,1] valued 
random variables such that for all (log n — log log m -|- log b) > p >0 

(i) All constraints have < ^ non-zero variables from y' and 

(ii) For all i, Yi ' u' ^ where Ai is the i-th row of A. 

(Hi) E[cjy'] = YiCiXi 

In other words we have a sparse system of equivalent constraints with the objective functional value 
unchanged in the expection. 


Remark For p = log n — log log m -\- log B, there are log m variables in each inequality summing 
upto 0(1). This specific result has been observed before by using a clever application of independent 
rounding in the following manner. Round each to with probability x[ log m. Then 

E[xj] = x( and no inequality has more than logm variables with high probability. 

Using the above sparsification, we obtain a number of interesting results arguably in simpler 
ways than known previously. 

Theorem 1.2 Let denote the family of m x n matrices where each row (independently) has 

k ones in randomly chosen columns from {1,2, ...n} and 0 elsewher^. Let A G then for 

any point x' = (x{, X 2 .. .x'Y, 0 < x( < 1 such that A - x' < e”*, where e”* is a vector of m 1 ’s, and 
OPT = YiP ■ where 1 = maxjjci} and Vi Cj > Then, x' can be rounded to x £ {0,1}"" such 
that the following holds with probability >1 —^ 


ll^^lloo 

y^^cjXj 


^ Q f\og{mkp\ogm/n) -|-log logm 
~ Y log log (m/cp log m/n-|-logm) 

> n{OPT) 


and 


( 1 ) 

( 2 ) 


Moreover such an x can he computed in randomized polynomial time. 

Remark (i) The result relates the rounding error to the average number of ones in a column, i.e., 
^ and characterizes tradeoffs between the parameters m, n, k. For example, for k = -yn, we can 
bound the error to O(loglogn) for m < • polylog{n) for p < This is a significant 

improvement over the O( io^^g^ ) bound of the independent rounding. 

^Alternately we can set every entry to be 1 with probability k/n independently 
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(ii) For k = logn, the bound is indeed tight. A proof is given in the appendix f |Vis| i. 

(hi) For the unweighted case, the result holds for arbitrary distribution of the A: I’s in each rowjl 

Theorem 1.3 Let A E , such that x' E [0,1]”' maximizes and satisfies A - x <b ■ 

with OPT = YliPX^. // 1 = maxjCi > 1/p, p > 1 and m < then x' can he rounded to 

X E {0,1}"" in polynomial time such that the following holds with high probability for h>l 

||Ax||qq < b and (3) 

> Ll{OPT/{max{plogm,\og‘^{m))^^’^)) (4) 

i 

We observe a trade-off between the weights and the approximation factor (plogm)) ' . For b = 
^(logFig^fegm)’ obtain 0 ( 1 ) approximation for for Cj > ^ for any constant O > 1 . 

The random matrices provides a natural framework for combinatorial results related to random 
hypergraphs. We sketch one such application to 6 -matching of /c-regular hypergraphs that yields 
an approximation kf!^ for 6 > 2 using the result of Theorem 11.31 and extends a result of Srinivasan 
[Sri96j . This implies that for 6 = 0(log logn/log log logn), we can can obtain a 6 matching of 
size Ll{OPT) for most hypergraphs which matches the best possible size given by the fractional 
optimum. It is known that A-uniform 6 -matching problem cannot be approximated better than 
for h<k/\ogk unless P = NP [OFSlllIHS^ . 

1.2 An overview and related work 

Our algorithm can be best characterized as a randomized iterated rounding where we begin from 
a fractional feasible (specifically optimal) solution x' and iteratively converge to a good integral 
solution. Our algorithm has two distinct stages - random walk stage (more precisely. Brownian walk) 
and subsequently in the second stage invokes the Moser-Tardos iterative scheme for constructive 
Lovasz Local Lemma (LLL). In the first stage we effectively slow down the RT rounding process. 
Our approach is intuitive - starting from x', for each variable (dimension), we will roughly increment 
(actually a normal Gaussian increment) Xi by ±7 for a suitably chosen 1 > 7 > 0 where the sign 
(direction) is a random variable. In each iteration, the values of xfs are modified and we continue 
this process for each Xi until it is in the range [0, 6 ] U [1 — 5, 1] for an appropriate 1 > 6 > 0 such 
that 5 > 'j. At this point we fix the variable and we terminate when all inequalities have less than 
some predetermined value u of unfixed variables . This stage has some similarities with the method 
of Lovette and Meka |LM12| but our analysis requires completely different techniques. The crux 
of the method called partial coloring lemma is a rounding strategy of an arbitrary x E [— 
vector within the discrepancy polytope defined by the constraints starting with x = ( 0 , 0 , ... 0 ). 
Their method can be mapped to {0,1} rounding as well, that was observed by Rothvoss |Rotl3| . 
Compared to the setting of the discrepacy rounding, we are dealing with smaller error margins 
and the variable and polytope constraints are not widely separated in terms of distances from the 
starting point. Moreover, one needs to also account for the deviation of the objective function which 
is not required for Spencer’s discrepancy result. 

2 A Rounding procedure using random walks 

The simple random walk based algorithm outlined in the introduction doesn’t take into account 
any of the constraints Vj ■ x <1 and therefore likely to violate them after some random walk steps. 

^ Set each variable to 1 with probability 1/fc, and for each violated constraint that has more than q = log((mfc)/n) 
ones, zero all Its variables (or zero enough of its variables, chosen arbitrarily, so that it has only q ones). At most n/2 
of the variables are heavy in the sense that they appear in more than 2mk/n constraints. For a light variable, even 
if it comes np 1, the probability that it is a member of a violated constraint is small (say, below 1/2), and hence the 
light variables (using linearity of expectation) guarantee a value of Q{n/k). 

This proof sketch was given by an anonymous reviewer of an earlier version. 
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Algorithm Iterative Randomized Rounding 

Input ■. x'i 1 < i < n satisfying Ax < b ■ e’^. C°: set of constraints. 

0<7<5<1- the exact values are discussed in the analysis. 

Output : Xi € {0,1} 

Initialize all variables as un-fixed and set Xq = [si,* 2 , • • • x'n]- 
Repeat for iterations i =1,2... 

1. Generate a random vector = Ui where Ui is a multidimensional gaussian r.v. restricted 
to the unfixed variables. 

2 . Xi+i = Xi + 7 ■ ff. 

3. Fix a variable if it is less than 5 or greater than 1 — 5. 

If all variables are fixed then exit. 

{ * multiple variables may get fixed in a single iteration. * } 

4. Update the set of constraints C* that contains at least one unabsorbed variable, 
until stopping condition S {* maxj{ number of unfixed variables in Cj} < logn *) 

Run Moser-Tardos |MTin| algorithm on on the unfixed variables of Xt according given in Figure [2] 

Round the fixed variables to 0 or 1 whichever is closer and return this vector denoted by x. 

Figure 1: An iterative randomized rounding algorithm based on random walks 

However, the probability that an Xi reaches 1 before it reaches 0 is equal to the ratio = x[. 

This is a consequence of a more general property of martingales known as Doob’s optional stopping 
theorem f |Fel 68 | i. 

Theorem 2.1 Let (Q,S,P) be a probability spaee and {Fi} be a filteration of LI, and X = {Xi} a 
martingale with respeet to {Tj}. Let T be a stopping time such that Vo; G LI, Vi, |Xj(u;)| < K for 
some positive integer K, and T is almost surely bounded. Then E[A 7 ’] = E[A'o]. 

In the above application, the stopping times are Xt = {—b, a} whichever is earlier. So —b ■ Pr[A = 
— 6] + aPr[X = a] = 0 • Pr[X = 0]. Since Pr[X = —b] + Pr[A = a] = 1 from the stopping criteria, 
Pr[X = a] = In our context, we start from X = x^ and b = ^ and a = ^ , so the probability 

that Xi = \ equals ~ distributions of the variables being absorbed at 0 or 1 are 

identical to the independent rounding. However, the random walk process has many intermediate 
steps that do not have corresponding mappings in the 2 ” possible configurations of the independent 
rounding. Whether it makes this framework more powerful compared to one-step independent 
rounding is a difficult question but we feel that it could give us a superior understanding of the 
process of independent rounding. 

In the algorithm presented in Figure [H instead of the simple Bernoulli random walk step, we use 
a normal Gaussian random walk on each coordinate. We run the basic algorithm for T iterations 
such that the number of un-fixed variables in each constraint of (the active set of constraints 
after T iterations) is bounded by logn. The value of T, more specifically E[T], will be determined 
during the course of our analysis. 

Now we run the algorithm of |MT10| (c.f. section 14.11) on the constraints in C'^ which are 
projections of the original constraints on the unfixed variables after T steps. 

2.1 The framework and some notations 

In the rounding algorithm, Xt denotes the random walk vector after t steps. Let LT^ denote a 
d-dimensional Gaussian random variable and let Ut denote the projection of t/” on the current 
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subspace corresponding to the unfixed variables in the tth iteration. The normal distribution is 
denoted by where /r is the mean and is the variance. Unless otherwise stated, we will 

refer to the standard normal distribution where /r = 0 and = 1 . 

The parameters 7 , 6 are chosen to ensure that the walk stays within the feasible region. It 
suffices to have 7 < from the pdf of the normal distribution if we are executing 0{^) steps 
(see |LM12) for a rigorous proof). The value of 6 will be determined according to the approximation 
factor and the error bound that we have in a specific application. In our analysis, the the focus will 
be on variables absorbed at 0 that will be rounded down. The decrease in the objective value can 
be at most n5 (recall the maximum weight of the coefficients is 1). If we choose 6 such that n6 is 
o{OPT) it will suffice. For the main theorems, choosing 5 < will work. 

After t iterations, the j-th constraint is denoted by Cj :< Vj,Xt > < 1 + Pt that has many of 
variables fixed. We shall denote the set of unfixed variables in iteration t hy U{t) C {1, 2 ... n} and 
the corresponding vector by where the the fixed variables are set to 0. The constraint vector 
Vj = Vj r\U{t), where only the coefficients in Vj corresponding to V({t) can be non-zero. Then 

I < Vj, Xt — Xfj^i > I = I < ||y^ || ,XY — > I • I |hjl I2 the change in the value of Cj in iteration 

t as the remaining variables do not change. Note that while X^ changes in every step, Vj changes 
only when some variable is absorbed. 

For convenience of the analysis, we will club successive iterations into phases, where within a 
phase p, fip remains unchanged. Equivalently, fip — fip-i reflects the cumulative effect of a number 
of random walk steps within the phase p referred to as the accumulated error or simply error. The 
phase p corresponds to the L 2 norm of any constraint ||F^^||2 that is bounded by ^nl‘2P. Intuitively, 
with additional random walk steps, we are more likely to violate the original constraints and fip is 
a measure of the violation. In terms of the above notation, it is obvious that — I|h^^ll 2 ' 

We will use the index p ( respectively t) to indicate reference to phases (resp. iterations). 

Wlog, we assume that all constraints have at least 2 variables and at most n — 1 variables. 
H The successive Brownian motion steps (defined by multidimensional normal Gaussian) form a 
martingale sequence. The following result forms the crux of our analysis - see [BanlOl ILM12| for 
more details and proof. 

Observation 2.1 In iteration t, let Yt =< Vj,Ut ■ 7 >, that is the measure of the change in 
<Vj,x > in the step t. Then Yt is a Gaussian random variable with mean 0 and variance 7 - ||ljj| 2 . 

Note that the scaled random variable Yf = is a Gaussian with mean 0 and variance < 1. 

When Yf correspond to standard Gaussian, then 

Lemma 2.1 For any /3 > 0, Pr)!!"/ + Y 2 ... Yf \ > /3] < 2exp(—/3^/2T). 

The behavior of random walks starting from an arbitrary initial position and subsequently 
absorbed at 0 can be obtained from the gambler’s ruin problem where the underlying martingale is 
the Brownian motion. There exists a wealth of literature on Brownian motion |Fel 68 l iRosOb) . but 
the specific form in which we invoke them for analyzing our algorithm is stated below. A proof is 
presented in the appendix. 

Lemma 2.2 Consider a random walk starting from position a in an interval of length a + h with 
absorbing barriers at both end-points. Then the expected number of steps for the walk to get absorbed 
(at any of the ends) is a ■ b. Moreover, the probability of the random walk being absorbed at 0 is 
— Ijk after k ■ a ■ b steps for any k > 1. 

Remark By choosing b > k ■ a, the probability can be made arbitrarily close to 1 for /c S> 1 - 
conversely, the probability of non-absorption at 0 is O(^) after /c^a^ steps. 

constraint with n variables has a trivial solution where any one variable can be set to 1 and a constraint with 
one variable is redundant. 
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3 Brownian walk analysis 


We will divide the analysis into two components - First we will compute the rate at which the 
variables are absorbed at 0. Second, we compute the increments of all variables during each iteration 
that causes an inequality to be violated. This causes the right hand side of any inequality to behave 
as a martingale and we will refer to it as the error. Note that the increase in error is related to 
the number of unabsorbed variables as they execute random walk. Once all variables are absorbed, 
then the error doesn’t change. For subsequent applications, we will derive the bounds for starting 
position scaled by S' > 1, i.e., from Readers who are familiar with |LM12) may note that our 
analysis focuses on variables associated with each constraint as opposed to the global number of 
variables. 

Lemma 3.1 Let x' G M”' be a feasible solution to A- x < B ■ 1"*, A G {0, i and x = ^, S > 1 

i-w-> l-w-l OP OP 

be chosen as the starting point for brownian walk. Then, after Tp = steps, with probability 

> 1 — , the number of unfixed variables in constraint is ) for 2 ^ < g(iog^))) EI- @ 

First, we can use Lemma 12.21 to get a bound on the probability of absorption. 

2 2 

Claim 1 For b > ra, after Brownian motion steps, the probability of non-absorption at 0 

is < 0 (i). 

The above claim is trivially true for r < 1. In Lemma [2.21 . use k = r,b = ra/j that gives absorption 

2 2 

probability < 0 (l/r) after r • a /7 • ar /7 = steps. 

Proof: (of Lemma [3T]) Choosing Tp = 

we can apply Claim [1] with r = to obtain that the probability that Xi is not absorbed at 0 is 

nxi 

< Let VL X < R be the i^^ constraint after p phases. If (= denotes the number 

of unabsorbed variables in constraint i after p phases then since 

Z^j=i — s • 

Note that the variables are independently executing Brownian walks. For > log (m) or 
equivalently, p < log(n) — log(log(m)) + log(i?) — log(S'), we can apply Chernoff bound to claim 
that is 0{^^) with probability > 1 — ^ □ 


Corollary 3.1.1 Since E{uf) = E{\\VP\\l) for ¥[ = {0,1}”, we can bound ||Vi ^||2 by with high 
probability for p < log(n) — log(log(mlog(n))) +log(i?) — log(5). 

Remark (i) For Aij G [0,1], the above proof on E(u^) doesn’t hold directly but the bound on 
IIII 2 is still valid. 

(ii) If we proceed upto p* = log(n) — log(log(m)) + log(R) — log(S') — log(c), all constraints have at 
most O(logm) unfixed coordinates. 

3.1 Error Bound 

From the previous result, for all with high probability. To bound the error 

consider a fixed constraint Cj : Vj.x < B, we denote the error accumulated in the phase by 5p, 
so that Yfq=l = f^p- 

^For m polynomial in n we need not distinguish between log m and log n 

^Applying a union bound over all constraints, we satisfy the conditions of the lemma for each of the phases and 
constraints with high probability 
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Lemma 3.2 For all constraints Cj, the error 6p < y • 

Thus the total error (3p upto logn —log(log m)+log(i?) — log(5') is bounded by c'^ for some eonstant 


Proof: Consider Pr(< 7(X]j=T^_^+i Ui), V^- > | > 6p) 
IT- ^ I > 


=Pr(| < E-Z 


2 —Tp—l + l 


Now < C/i, ^ > Af(0,a^) where < 1. From Lemma l2.1[ it follows that the above is 

bounded by exp(— ^a||y.||i2(T -T —I))' ^ hold simultaneously for all the m constraints in the 

phase, if 6p satishes 

7^ 11V, 11 ■^2{Tp-Tp_ 1 ) > ^ (log _ 

i.e. if Sp > 'y\\Vj\\-s/2 ■ logmfTp - Tp_i) 

From Corollary [3JJJ 1 1 11 ^ and Tp = 

it suffices to choose 


> 7 


nB 


S ■ 2P 


— ■ v^ 2 log 


m 


2P ■ 2P 

fl2^2 


= 2 


B log m2P 
S-n 


The above bound for 5p holds with high probability when p < logn — log(logm) + log(i?) — log(S') 
since only in this situation we can bound effective value of Vj. 

Thus the total error upto logn — log(logm) + log(i?) — log(5') is bounded by 


logn-log(logm)+log(B)-log(5) 

E 

p=i 


, B log m p 
2\l ^^ 22 < 2 
b ■ n 


= 2 


B log m , logn-log(logm)+ iogB-log(S) , 


B logm 
S-n 


0(1 


—JZE —) = o{—) 

S-\ogm’ 


(5) 

( 6 ) 


Therefore, the total error in every constraint is bounded by < Vj,xP >=< Vj,x + 'jEiZi >= 

0 (f). □ 

Thus the solution obtained satishes the constraints A • x < f. 

The above method can be extended to obtain an alternate proof the 0(logm/log logm) error 
bound of Raghavan-Thompson that we have ommitted from this version. 

Based on the above results, we summarize as follows. 


Corollary 3.2.1 In the first stage of Algorithm Iterative Randomized Rounding, we run the 
algorithm for T = g 2 ^^2 steps. Then with high probability 
(i) all constraints have < logm unfixed variables and 

(a) the total error in any constraint is bounded by c'B/S for some constant d. 

Using multiple eopies of a variable, results similar to this section have been used before that 
are ad hoc to specihc applications [CC091 ISri96| . However, it requires two stages of rounding - 
once choosing exactly one copy followed by a phase of independent rounding. In comparison, our 
technique and analysis are more general. 


4 Applications using LLL 

At the end of brownian walk(that is after ^ 210 ^ 2^.^2 steps), we have atmost log(m) unconverged 
variables occurring(with non zero coefficients) in each equation. Now we need to bound the error in 




































independantly rounding the unconverged Now we need to bound the error in independantly round¬ 
ing the unconverged variables. As per the notations setup in section [2l V^* represents the coefficient 
vector of constraint after p* phases. 

A 

Define A as a matrix having rows as V[*. 

A 

Now if the unconverged variables are rounded from x to x, the change in the value of RHS is 

A ■ {x — x) = A ■ {x — x). (Since only unconverged variables change). 

Hence to calculate additional error due to independant rounding we only need to consider the 
A 

"‘sparsified"’ matrix A. 

Before we prove the main results, we consider the simpler case of columns with bounded number of 
ones - Suppose the matrix has no more than p I’s in any column. Let OPT be the optimal 

fractional objective value for the weighted objective function Ci ■ Xi. Consider a fixed constraint 
Cr, that contains logm I’s after the Brownian motion and let jiir), ■ ■ ■ denote the (at most) 
logn columns that contain 1. We say that a constraint Cy is dependent on Cr if they share at 
least one column where the value is 1. So, the dependency of any single constraint can be bound 

A 

by plogm (Since there are logm I’s per row in the sparsified matrix A). If we use the independent 
rounding to round the fractional solution xh the probability that the value of a constraint exceeds 
t > 1 is bounded ^ from Chernoff bounds Ij. Let Ei denote the event that Ci exceeds t when we 
use randomized rounding. We are interested to know the probability of the event since 

this implies the event that all the inequalities are less than t. This is tailor-made for Lovasz Local 
Lemma (LLL). We also want to guarantee a large value of the objective function. For example, 
setting all variables equal to zero would guarantee feasibility but also return an objective function 
value 0. Thus we define an additional event A^+i corresponding to the objective function value 
less than (1 — e) • OPT for some suitable 1 > e > 0. Since A^+i is a function of all the variables, it 
has dependencies with all other Aj i = 1.. .m, therefore we have to use the generalized version of 
LLL in this case. 

Theorem 4.1 (Lovasz Local Lemma |PL75| I Let Aj, 1 < i < N be events such that Pr[Ai] = p 
and each event is dependent on at most d other events. Then if ep{d -|- 1) < 1, then 
Pr(Ai n A 2 ... An) > 0. 

Alternately, in a more general (asymmetric) case, where the dependencies are described by a graph 
({1,2... N},E) where an edge between i,j denotes dependency between Ai,Aj and yi are real num¬ 
bers such that Pr(Ai) < yi ■ ]{[(*,j)e£;(l - Vj) then Pr > OilAl “ Vj)- 

Moreover, such an event can be computed in randomized polynomial time using an algorithm of 
Moser and Tardos JMT 1 d)/ . 

If we choose t such that e • • (^ + 1) — 1 equivalently t = logd, then we can apply the 

previous theorem to obtain a rounding that satisfies an error bound of 0(logp -|- log logm). The 
error t can be improved to by using a tighter version of the Chernoff bound (Equation 

USD. 

We define A^+i as the event where the objective value is less than (1 — e)OPT. From Chernoff- 
Hoeffding bounds, we know that Pr(Am+i) < exp(—e^OPT/2). We define yi for i = 1, 2 ... m as 
before, corresponding to the probability of exceeding t = logd/log logd. 

By choosing yi = l/(Q:d) i < m and Pm+i = for some suitable scaling factor a > e, we must 
®This is a slightly weaker version to keep the expression simple 
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Algorithm LLL based Iterative Randomized Rounding 

Input : x'i 1 < i < n,t (error parameter) 

Output : Xi £ {0,1} 

Do independent rounding on all the variables having values x'i to Xi. 

Compute the value of each constraint Ci as < Vi,x > 

While any inequality exceeds t or objective value is < OPT /2 

1. Pick an arbitrary constraint Cj that exceeds t and perform independent rounding on all 
the variables in Vj. 

2. Update the value of the constraints whose variables have changed 
Return the rounded vector x. 


Figure 2: An iterative randomized rounding algorithm based on Moser-Tardos 


satisfy the following inequalities 

— l/{ad))'^ ■ - i = l,2...m (7) 

Pr(Am+i) < ^(1 - 1/iad))'^ < ^ • exp(-m/(ad)) (8) 

The first condition is easily satisfied when Pr(Aj) = 

To satisfy the second inequality, we can choose a so that OPT > Q(^). This implies that 
exp(—m/(da) > exp(—e^OPT/2)), so condition ([SD is satished for LLL to be applicable. 


We summarize our discussion as follows 


Lemma 4.1 For A G {0, with a maximum of p I’s in each column, we can round the 

optimum solution x* of the linear program max^^^jCjXj s.t.Ax <1 ,0 < Xi < 1 to x G {0,1}"" 
such that 


\\Ax\ 


- (1—i —n -T ) ’ i— n ( ^ 

V log log [p log m) J log(log( 


and CjXj > (1 — e)OPT 

i 


Proof: The error bound follows from the previous discussion by setting t/j = and using the 

stronger form of Chernoff bound in Equation 1181 Note that Equation [8] can be satisfied by ensuring 
OPT > ^ by choosing an appropriately large a. 

The above is satished for a > Qfff i-e. if a.d = 

In this case the discrepancy is 

Hence the result follows. □ 


The objective function still follows the martingale property since the variables starting the 
random walk at x[ have probability x[ of being absorbed at 1 which is identical to the independent 
rounding that we use in the second phase for Moser-Tardos algorithm. Although, the random 
walk is short-cut by a single step independent rounding, the distribution for absorption at 0/1 
remains unchanged. To see this, consider the last time a variable Xi is rounded by the Moser-Tardos 
algorithm - the probability is x^ since it is done independently every time. 

4.1 Random matrices 

Let denote the family of m x n 0-1 matrix with exactly k Is in each of the m rows chosen 

uniformly at random. Clearly = 1/A: is a feasible solution with objective function value n/k. 
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After rounding x' to x, we want to achieve an objective value Vt{nlk) Is in the rounded vector. In 
addition, A - x' <t ■ for error guarantee t. 

To compute the dependency d for Cr, we observe that another constraint Ci will contain a 1 
in ji(r) with probability i.e., if it had one of the k randomly chosen I’s in that column, which 
is Since all the rows were chosen independently, the expected number of rows among m rows 
that have a 1 in column ji(r) can be bounded by ^ and by 0(max{^,logm}) with probability 
greater than 1 — 1/m. Since this holds for all the positions, ji{r), by the union bound, and using 
max{a, b} < (a + 5), we can claim the following 

Claim 2 The total number of constraints that are correlated to Cr can be bound by _|_ 

log^ m) with high probability. 


In our context, for k = log m, d = "^ Tlog m^). We can verify this by exhaustively computing 

the dependence - if it exceeds this then our algorithm is deemed to have failed which is bounded 
by inverse polynomial probability. If we choose t such that e • • (d + 1) < 1 or equivalently 

t = n(log( ™'^°^ + log^m)), then we can apply the previous theorem to obtain a rounding that 

satisfies an error bound of 0 (log( ”^^°^ ^ + log^m)). 

For m bounded by nlog*^^^^ m this is O(loglogm) which is substantially better than the Raghavan- 
Thompson bound. 

However for m > for any constant e > 0, it is no better. 

Proof: (Completing the Proof of Theorem ll.2p Now we formally define our bad events Ei here. 

VI < i < m, define Ei = {Ajx > d).(where we choose error to be d). Define E^+i = (c^.x < 

(1 — e)OPT). We have Pr{Ei) < and Pr{Em+i) < e ^ . We choose y* = ^,for 

some a > 1. and choose j/m+i = To apply LLL we need, Pr{Ei) < ^(1 — ^)'^-(l ~ 5 ) and 
Pr{Em+i) < ^(1 - To satisfy these equations we get error,(5 = 

-e^.OPT -m 

Also we get e 2 < 


^og{log{a.d)) • 

The second equation is satisfied for OPT > or a.d > 


J. l.XOVy VV O . L.4- ^ L-LCX U XV-/J.X XO OCO O XOXXV^vJ. XV-/X V.y' J. -X ....-^ ^ • 

Combining the 2 results we have error 6 in constraints as max i , I 

& \ log(log(d)) ’ log(log( Q^)) J 


OPT- 


which is max 


log( )+log log m \ loglfTpr) 

logllog/oPy)) 


When Cj > i we have OPT > Sub- 

'' — p — k.p 


log log( log m) 

stituting the value of OPT in the equation and using the fact that — = 0(log(m)), we prove 
1 ^—■ 2 
Theorem 11.21 The expected running time for the second phase is 0{^polylog{n)) in our case, 

due to application of Moser Tardos. 

Here the error bound may fail for the additional reason that the dependence may exceed 

after the Brownian motion phase. This can happen with probability ^n(i) ■ This part is related the 

input distribution of and repeating the algorithm may not work. □ 


Remark A direct application of LLL (without running the Brownian motion) would increase the 
dependence to _|_ log mlog n). In order to maintain the same asymptotic error bound, 

the number of rows in the matrix, m, could be significantly less. For example, if m, /c = then 
the difference would be 0(logn/loglogn) versus O(log log n/log log log n). 

The proof of Theorem 11.31 is along similar lines after appropriate scaling and is given in the 
appendix. 


5 Applications of our rounding results 

In this section we briefly sketch the applications of our rounding theorems. Although these do not 
significantly improve prior results, our parameterization could simplify further applications. 
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5.1 Application to Switching circuits 

Consider an re-input butterfly network (with re log re total nodes) with i < re log re source 

destination pairs where each input/output node has log re sources and log re destinations. For any 
arbitrary instance of routing, we can do a two phase routing with a random intermediate destination, 
say, we choose a random intermediate destination for the source-destination pair We want 

to route a maximum number of pairs subject to some edge capacity constraints. 

If re = 2^, then the the expected congestion k of an edge for a random permutation is log re • 
— = log re and moreover it can be bounded by clog re with high probability for some constant c. 
So there exists a fractional solution with flow value for each of the re log re paths with objective 
function value re. 

Let ^4 be an rre X t matrix where rre is the number of edges and t = n log re is the number of paths. 
The edges are denoted by ci, 62 ... and the paths are denoted by Hi, 112 ... lit. Then Aij = 1 iff 
the flow lij passes through edge e,. The number of edges in a path is bounded by L = log re. The 
value of the flow through path Hj ( denoted by fj) is the amount of flow and let / be the vector 
denoting all the flows. Then A ■ f < c where c = (ci, C 2 ... Cm) is the vector corresponding to the 
congestion in edges (ci, 62 ... Cm)- 

Since the congestion is bounded by clog re, A G and there exists a fractional solution 

/ = (1/clogre, 1/clogre... 1/clogre with objective value Q.[n). Using p = O(logre) in Lemma l4.ll 
we can round it to a 0-1 solution that yields the following result. 

Theorem 5.1 In an re input BBN butterfly network having 2(relogre) edges, we can route the Il{n) 
source-sink pairs with congestion 0(log log re/log log log re). From the capacity constraint, it follows 
that no node contains more than O(loglogre) sources or destination. 


Note that we can also handle weighted objective functions. 

The above result matches the previous results of |MS99l which have the advan¬ 

tage of being online. Using B = log log re/log log log re, in Lemma I A. 21 we can obtain an opti¬ 
mal bound. Note that, in this case we can match asymptotically the optimal fractional flow of 
re log log re/ log log log re) with Xi = eloglolSlogn • 


Theorem 5.2 In an re input multi-butterfly network having re log re edges, we can route iog°og°og’^ ) 
flows with congestion 0 (log log re/log log log re) where each node can be the source or sink of at most 
0 (log log re/log log log re) flows. 


The above result marginally extends a similar result of Maggs and Sitaraman |MS99) where they 
can route (re/log^^'^re) pairs in the online case. It remains an open problem to find a fast online 
implementation of our rounding algorithms. 


5.2 Mciximum independent set of rectangles 

Consider a ^/nx^/n grid and a set of axis-parallel rectangles that are aliened with the grid points,i.e., 
the upper-left and the lower-bottom corners are incident on the griqj- Each rectangle contains A 
grid points for some A < n but they can have any aspect ratio. Trivially, the different types of such 
rectangles can be at most A where A = a ■ b for a = 1,2... A. Two such rectangles ri and r 2 are 
overlapping if r* H r 2 contains one or more grid points. 

From the A possible rectangles whose upper-right corners are anchored at a specihc grid point 
p, the input consists of one such rectangle. To avoid messy calculations, we assume that the grid 
is actually embedded on a torus. Given a set 5 of re such rectangles, our goal is to select a large 
non-overlapping set of rectangles. Although this is a restricted version of the general MISR problem 
it still captures many applications. 

can assume that all the grid-points that are flush with the sides are in the interior by slightly enlarging the 
rectangle 
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For any given grid point p, let S{p) denote the set of rectangles containing p. This can be 
bounded by a x A/a = 0{A‘^). After solving the relevant packing linear program, the n x n 

point-rectangle matrix A contains 1 in the {i,j) position if the i-th grid point is incident on the 
j-th rectangle. From our previous observation, no row contains more than I’s. 

By setting k = we can apply our rounding results to obtain a wide range of trade-offs for this 
problem. For example, if A is polylog(n), then using p = A in Lemma 14.11 we can choose kl{OPT) 
rectangles with no more than 0 (loglogn/logloglogn) overlapping rectangles on any clique, where 
OPT is the fractional optimal solution of the corresponding packing problem. 

Further, using a result of [LENO04] . we can obtain non-overlapping rectangles 

as well as a l°gl°g ^ approximate solution in the weighted case. 

log log log n ^ ^ ^ 

5.3 6-matching in random hypergraphs 

In a hypergraph H on n vertices has hyper-edges defined by subsets of vertices. We wish to choose 
the maximum set of hyper-edges such that no more than b hyperedges are incident on any vertex. 
An integer linear program can be written easily for this problem with n constraints and m variables 
corresponding to each of the m edges. Let A be an n x m matrix where Aij = 1 if vertex i is 
incident on the j-th hyperedge. Let Xi = {0,1} depending on if the f-th hyperedge is selected in 
the 6 -matching. 

Maximize Xi s.t. Ax < 6 • e™ G {0,1} 

i 

The relaxed LP can be solved and the solution may be rounded. The weighted version can be 
formulated similarly by associating weights Wi for Xj. 

If the m edges of H are random subsets of vertices, then we can represent the fractional solution 
as an n X m random matrix with each entry of the matrix set to 1 with probability k/n. That is, 
each hyperedge has k vertices and chooses k vertices randomly. So the expected number of edges 
incident on a vertex is There exists a feasible fractional solution using Xj 1 < f < m = ^ 
with objective value OPT > n/k. 

For 6 = (log log n/log log log n), we can can obtain a 6 matching of size 0{OPT) for most 
hypergraphs which matches the best possible size given by the fractional optimum. In general, we 
obtain an approximation k^^^ for 6 > 2 using the result of Theorem 11.31 including the weighted 
version. 

It is known that /c-uniform 6 -matching problem cannot be approximated better than for 

h < k/ log k unless P = NP [OFSlll IHSS06) . So our result shows that the bound can be much 
better for many /c-uniform hypergraphs, for example k = logn and 6 > 2. A closely related result 
for bounded column case was observed by Srinivasan |Sri96| . Acknowledgement The author would 
like thank Antar Bandhopadhyay for pointing out the generality of the Optional Stopping theorem and Deeparnab 
Chakrabarty for pointing out the result in |CGKT07] . A lower-bound construction was given by Nisheeth Vishnoi 
and Mohit Singh that refuted an earlier stronger claim by the author. The authors also acknowledges Nikhil Bansal 
for some useful observations with regards to formalizing the main theorem. 
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A Appendix 

Proof of Lemma 12.21 
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Proof: The proof of the gambler’s ruin problem actually uses a quadratic martingal^ ~ t 

where B{t) is the Brownian motion random variable. Using the optional stopping theorem on this, 
we obtain 

IE[B 2 (r) - T] = E[.B 2 ( 0 ) - 0 ] = 

Since p = is the probability that it is at 0, we obtain E[r] = a ■ b. 

Now, consider the events leading to the failure of being absorbed at 0 - these correspond to the 
absorption of the random walk at either ends or the non-absorption at either ends. 

Pr[Failure] = Pr[Failure n Absorption] + Pr[Failure n Non — absorption] 

= Fic[Failure\Absorption] ■ Pr[Absorption] -|- Pr[Failure\Non — absorption] ■ Pr[iVon — absorption] 

< Pj:[Failure\ Absorption] Pr[iVon — absorption] 

From the preceding argument ,the probability that the random walk is not absorbed after ka ■ b 
steps is less than 1/k using Markov’s inequality. From optional stopping criteria, the probability 
of absorption at 0 is The probability that the random walk is not absorbed at 0 after kab 

steps is bounded by 1 — + l/k. Consequently, the probability of being absorbed at 0 is at least 

^ - lA- □ 


A.l A Lower Bound on error 


Lemma A.l For A G such that A ■ x < e™', we cannot simultaneously obtain ~ 

0(n/logn) and \\Ax\\^ < o( ioS°iogrx ) ^ {OA}- 


Proof: 

As mentioned before, it is easy to see that Xj = 1/A; is a solution for x with objective function 
value n/k. After rounding x we want to have at least Pl{n/k) Is in the rounded vector, say y. We 
must have A ■ y < t ■ e for some rounding guarantee t. Let 6 be a fixed vector with n/k Is. 

Let r be a row vector with k Is in random location - this corresponds to a row of A. The 
probability that < r,y > > t is given by 


({n/k)^ . ({n-n/k)-^ 


V i J 


o 


Using < (^) < the above expression is 


> 


> 


f n \t f n — nik * 

Ktk^ A k-t ^ 

/ ne \k 

k ' 

1 / k — 1 — 

? 

f^k — t 
1 

-- ,,1/2 , , for A: > t 

1 

ek-t/l.p. 


“The proof that it is a martingale can be found in standard books on stochastic process 


(9) 

( 10 ) 

( 11 ) 

( 12 ) 

(13) 

(14) 
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So the probability that for m independently chosen rows, the probability that < r,y > < t is 


p{m,k,t) < 
< 


1 - 


1 


gfc —£/2 . 
m 

\ \ fkft 

e 









ft 


(15) 

(16) 


Since there are no more than choice of columns with n/logn Is, the probability that all 

the dot products are less than t is less than ■ p{m,k,t) < p{m,k,t) ■ < 

e” • p{m, k, t). If e” • p{m, k,t) < 1 then there must exist matrices in that do not satisfy the 

error bound t. So, 

m 

( 1 \ m 

.e”^(e)^>e^ 

which is equivalent to the condition 

— > or log(m/n) > k + t\ogt (17) 

n 

For k = logn and m = n ■ polylog{n), this holds for some t > be., for some constant 

a > 0, i.e., the error cannot be □ 


A.2 Proof of the general approximation bound: Proof of Theorem 11.31 

If we are interested in maintaining feasibility of constraints, we can ensure it by sacrificing the value 
of the objective function according to some trade-offs. The same algorithm works, where we use 
the known method of damping the probabilities of rounding 1 |RT87[ ISri95) i. 

Let Xi , X 2 ■ ■ ■ Xu denote the unabsorbed random variables in a constraint after the Brownian 
walk phase and let U = 'YhiXi = ^ for some /? > 1. Let Xi be the {0,1} random variables where 
Pr[Xi = l]= Xi and so • Xi] =Y^.Xi = j. To apply LLL, we need Pr[X)j A, > S] < 1. 

Since U = Xi, then from Chernoff bounds, we know that 


Fi[U > (1 + A)E[U] < 


lE[U] 


(1 + A) 


l+A 


(18) 


Using E[U] = ^ and /3 = 1 -|- A we obtain Fi[U > B] < 
parameter a > 1. It follows that 

{l3le)^ > da 




V W J 


B/p 


- 


B 


< IS 


for some scale 


(19) 


Therefore /3 = e{da)^^^. Since our matrices are 0-1, the minimum infeasibility happens at i? -|- 1, so 
we can substitute B + 1 instead of B in the previous calculations to obtain /3 = . Using 

S' > 1 as the initial scaling for Brownian walk, we obtain the following result using Lemma [3.21 


Lemma A.2 For B > 1, we can round the fractional solution to a feasible integral solution with 

1 1 

objective function value ^{OPT/S) for S—max({ Jf,rp )B -^, ). If B ^ Z,S can be replaced by 

B+1 in above exprssion and proof 

Proof: If we start from x = ^, the solution produced by brownian walk satisfies the constraints 
with RHS ^(as shown in Lemma 13.21) . 

Now we define events for LLL similar to the proof of Lemma [4.11 
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VI < i < m define Ei = {Af .x > B). 
and define E^+i = -x < (1 — e).OPT. 

As shown above, VI < z < m, Pr{Ei) < (^)^- 

Also by chernoff’s bound Pr[Em+i) < . 

We apply LLL with weights Vi = ^) for 1 < z < m 
and ym+\ = 

, for some appropriately defined a > 1. 

Now we need (|)^ < ^(1 - 

— e'^OPT . . T m 

and e 2/3 < i _ ^)™- < i.e“'^^(From Equation ([8])) 

The above is satisfied for ^ 

Thus we need to choose 

(a.a) B 

OR/3 = (a.d)fo^ > (^)^. 

Also a > l.For B G Z, it would have sufficed to use R' = i? + 1 as opposed to choosing B which 
proves the stated lemma. □ 

To prove Theorem ll.3[ we can observe that for Ci > we have OPT > For ^ = 0(log(m)), 

2 1 

we get the scaling as mox((log(m))'B, (plog(m))'B), which proves Theorem 11.31 


17 






